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Abstract. We study various complexity properties of sufBx-free regular languages. The quo¬ 
tient complexity of a regular language L is the number of left quotients of L; this is the same 
as the state complexity of L, which is the number of states in a minimal deterministic finite 
automaton (DFA) accepting L. A regular language L' is a dialect of a regular language L if 
it differs only slightly from L (for example, the roles of the letters of L' are a permutation of 
the roles of the letters of L). The quotient complexity of an operation on regular languages 
is the maximal quotient complexity of the result of the operation expressed as a function of 
the quotient complexities of the operands. A sequence {Lk, Lk+i, ...) of regular languages in 
some class C, where n is the quotient complexity of Ln, is called a stream. A stream is most 
complex in class C if its languages L„ meet the complexity upper bounds for all basic mea¬ 
sures, namely, they meet the quotient complexity upper bounds for star and reversal; they 
have largest syntactic semigroups; they have the maximal numbers of atoms, each of which 
has maximal quotient complexity; and (possibly together with their dialects L'^) they meet 
the quotient complexity upper bounds for boolean operations and product (concatenation). 
It is known that there exist such most complex streams in the class of regular languages, in 
the class of prefix-free languages, and also in the classes of right, left, and two-sided ideals. 
In contrast to this, we prove that there does not exist a most complex stream in the class of 
sufRx-free regular languages. However, we do exhibit one ternary suffix-free stream that meets 
the bound for product and whose restrictions to binary alphabets meet the bounds for star 
and boolean operations. We also exhibit a quinary stream that meets the bounds for boolean 
operations, reversal, size of syntactic semigroup, and atom complexities. Moreover, we solve 
an open problem about the bound for the product of two languages of quotient complexities 
m and n in the binary case by showing that it can be met for infinitely many m and n. Two 
transition semigroups play an important role for suffix-free languages: semigroup T^®(n) is 
a suffix-free semigroup that has maximal cardinality for 2 Si n ^ 5, while semigroup T^®(n) 
has maximal cardinality for n = 2, 3, and n 6. We prove that all witnesses meeting the 
bounds for the star and the second witness in a product must have transition semigroups 
in T^®(n). On the other hand, witnesses meeting the bounds for reversal, size of syntactic 
semigroup, and the complexity of atoms must have semigroups in T^®(n). 

Keywords: most complex, regular language, state complexity, suffix-free, syntactic complex¬ 
ity, transition semigroup 


1 Introduction 

We study complexity properties of suffix-free regular languages. A much shorter preliminary version 
of these results appeared in m without any proofs. 


Motivation The state complexity k{L) of a regular language L over an alphabet S is the number 
of states in a minimal deterministic finite automaton (DFA) with input alphabet S recognizing 
L. The state complexity of a regularity preserving unary operation o on regular languages is the 
maximal value of k{L^) as a function of n, where L„ varies over all regular languages with 
state complexity n. Similarly, the state complexity of a regularity preserving binary operation o on 
regular languages is the maximal value of K{Km o L^) as a function of m and n, where and 
vary over all regular languages of state complexities m and n, respectively. Of special interest are 
the state complexities of common operations on regular languages. The state complexities of union, 
product (concatenation), (Kleene) star and reversal were studied by Maslov [32] in 1970, but this 
work was not well known for many years. The paper by Yu, Zhuang, and Salomaa [35] inspired 
considerable interest in these problems and much research has been done on this topic in the past 
22 years. The state complexity of an operation gives a worst-case lower bound on the time and 
space complexities of the operation. For this reason it has been studied extensively; see [5127130] 
for additional references. 

Consider, for example, the product of two regular languages. To find its state complexity we 
need to establish an upper bound for it, and find two languages of state complexities m and n, 
respectively that meet this bound. It is known that (m — 2)2" -|- 2"“^ is a tight upper bound on 
product. The languages that meet this bound are called witnesses. In general, different witnesses 
have been used for the two arguments and for different operations. However, Brzozowski [B] has 
shown that one witness and its slightly modified version called a dialect is sufficient to meet the 
bounds on all binary boolean operations, product, star, and reversal. The DFA of this witness is 
shown in Figured) Let the language recognized by this DFA be L„(a, 6, c), and let L„(&, a,c) be 
the language of the DFA obtained from that of Figure [T] by interchanging the roles of a and b. 
Then Lm{a,b,c) and Ln{b,a,c) meet the bound mn for all binary boolean operations, Lm{a,b,c) 
and Ln{a,b,c) meet the bound (m — 1)2" -|- 2"“^ for product, and Ln{a,b,c) meets the bounds 
2"“^ -I- 2"“^ for star and 2" for reversal. 


c c b,c b b,c 



Although state complexity is a useful measure, it has some deficiencies. The language L„(a, b, c) 
and the language {a, 6, c}"“^ both have state complexity n, but the second language is intuitively 
much simpler. Secondly, all the bounds for common operations - except reversal, which misses the 
bound by one state - are also met by star-free languages [T3], where the class of star-free lan¬ 
guages is the smallest class containing the finite languages and closed under boolean operations and 
product, but not star. For these reasons Brzozowski [^ suggested additional complexity measures: 
the maximal size of the syntactic semigroup of the language and the state complexities of atoms 
(discussed later). It turns out that the language defined in Figure [T] also meets these bounds. For 
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these reasons this language has been called a most complex regular language. The added measure 
of the size of the syntactic semigroup distinguishes well between Ln{a,b,c) and {a,b,c}‘^~^ since 
the semigroup of the first language has n" elements, while the second has n — 1 elements. 

In previous work on state complexity it was always assumed that the two arguments in product 
and binary boolean operations are restricted to be over the same alphabet. But Brzozowski pointed 
out in 2016 [7] that operations on languages over different alphabets, unrestricted operations, have 
higher tight upper bounds. It was shown that witnesses very similar to those of Figure [I] also meet 
these bounds mm, and thus are most complex for both restricted and unrestricted operations. For 
suffix-free languages the restricted and unrestricted complexities are the same. 

Most complex languages are useful for testing the efficiency of systems. Hence to check the 
maximal size of the objects that a system can handle, we can use most complex languages. It is 
certainly simpler to have just one or two worst-case examples. 

The question now arises whether most complex languages also exist in subclasses of regular 
languages. A natural source of subclasses is obtained from the notion of convexity. Convex languages 
were introduced by Thierrin [34] and studied later in [1]. They can be defined with respect to 
arbitrary binary relations on an alphabet E* , but the relations “is a prefix of”, “is a suffix of” and 
“is a factor of” turned out to be of considerable interest, where if re = xyz G S* , then cc is a prefix 
of w, y is a factor, and z is a suffix. A language L is prefix-convex if, whenever w = xyz and w and 
X are in L, then so is xy. Factor-convex and suffix-convex languages are defined in a similar way. 
Brzozowski began studying the complexity properties of these languages in |4] . 

The class of prefix-convex languages has four natural subclasses: right ideals (languages L sat¬ 
isfying L = LS*), prefix-closed languages (where w in L implies that every prefix of w is in 
L), prefix-free languages (where m in L implies that no prefix of w other than w is in L), and 
prefix-proper languages that are prefix-convex but do not belong to any one of the three special 
subclasses. Similarly, there are four subclasses of suffix-free languages: left ideals (languages L 
satisfying L = S*L), suffix-closed languages, suffx-free languages, and suffx-proper languages. 
Finally, there are four subclasses of factor-free languages: two-sided ideals (languages L satisfying 
L = E*LE*), factor-closed languages, factor-free languages, and factor-proper languages. Ideals 
appear in pattern matching |24] : if we are looking for all the words in a given text that begin with 
words in a pattern language L, then we are dealing with the right ideal LE*, and similar state¬ 
ments apply to left and two-sided ideals. Prefix-closed (respectively, suffix-closed, factor closed) 
languages are complements of right ideals (respectively, left ideals, two-sided ideals). Prefix-free 
(respectively, suffix-free, factor-free) languages, other than the language consisting of the empty 
word, are codes [3], and have many applications, particularly in cryptography, data compression 
and error correction. 

Most complex left, right and two-sided ideals for restricted operations were found in [10], and 
unrestricted operations were added in m- Most complex prefix-closed, and prefix-free languages 
were exhibited in m and proper prefix-convex languages in |15j . Most complex suffix-closed lan¬ 
guages were found in M- In contrast to these results, the first example of a subclass of the class of 
regular languages that does not have a most complex language is that of suffix-free languages. This 
result was reported in ED], and is the subject of the present paper. It is also known (unpublished 
result) that most complex proper suffix-convex languages do not exist. 

Quotient Complexity A basic complexity measure of a regular language L over an alphabet E 
is the number n of distinct left quotients of L, where a (left) quotient of L by a word w G E* 
is w~^L = {x I wx G L}. We denote the set of quotients of L by AT = {Kq, ..., K^-i}, where 
Kq = L = e~^L by convention. Each quotient can be represented also as w~^L, where Wi G E* 
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is such that w~^L = Ki. The number of quotients of L is its quotient complexity [5]. Since the set 
of quotients of L is the same as the number of states in a minimal DFA recognizing L, the quotient 
complexity of L is the same as its state complexity; however, quotient complexity suggests language- 
theoretic methods, whereas state complexity deals with automata. The quotient complexities of 
unary operation and binary operations are defined analogously to state complexities. 

To establish the state/quotient complexity of a regularity preserving unary operation o we need a 
sequence {Ln,n ^ k) = {Lk,Lk+i, ■■ ■), called a stream, of witness languages that meet this bound; 
here k is usually some small integer because the bound may not apply for n < fc. The languages 
in a stream are normally defined in the same way, differing only in the parameter n. For example, 
the languages of the DFAs of Figured] define the stream {Ln{a,b,c) | n ^ 3). Similarly, to establish 
the state/quotient complexity of a regularity preserving binary operation o on regular languages 
we have to find two streams ^ h) and {Ln,n ^ fc) of languages meeting this bound. 

The state/quotient complexity of suffix-free languages was examined in [23125129] . 

We also extend the notions of maximal complexity, stream, and witness to DFAs. 

Syntactic Complexity A second measure of complexity of a regular language is its syntactic 
complexity. Let be the set of non-empty words of E*. The syntactic semigroup of L is the 
set of equivalence classes of the My hill congruence on E'^ defined by a; 2/ if and only if 
uxv G L uyv G L for all u,v € E* . The syntactic semigroup of L is isomorphic to the transition 
semigroup of a minimal DFA T> recognizing L [33j . which is the semigroup of transformations of 
the state set of "D induced by non-empty words. The syntactic complexity of L is the cardinality of 
its syntactic/transition semigroup. 

Holzer and Konig [26j . and independently Krawetz, Lawrence and Shallit m studied the syntac¬ 
tic complexity in the classes of unary and binary regular languages. This problem was also solved 
for the classes of right ideals [16122] . left ideals [16118122] . two-sided ideals [16118122] . prefix-free 
languages [12], and suffix-free languages [unj. 

Complexities of Atoms A possible third measure of complexity of a regular language L is the 
number and quotient complexities, which we call simply complexities, of certain languages, called 
atoms, uniquely defined by L. Atoms arise from an equivalence on E* which is a left congruence 
refined by the My hill congruence, where two words x and y are equivalent ii ux G L if and only if 
uy & L for all u G E* [28]. Thus x and y are equivalent if a; G u~^L y G u~^L. An equivalence 
class of this relation is called an atom [21] of L. It follows that an atom is a non-empty intersection 
of complemented and uncomplemented quotients of L. The quotients of a language are unions of 
its atoms. 

Terminology and Notation A deterministic finite automaton (DFA) is defined as a quintuple 
D = (Q, E, S,qo, F), where Q is a finite non-empty set of states, A is a finite non-empty alphabet, 
6: Q X E ^ Q is the transition function, qo G Q is the initial state, and F Q Q is the set of final 
states. We extend i5 to a function 5-. Q x E* ^ Q as usual. A DFA V accepts a word w € E* if 
S{qo, w) G F. The language accepted by V is denoted by L{V). If y is a state of V, then the language 
L'^ of q is the language accepted by the DFA {Q, E, 6, q, F). A state is empty if its language is empty. 
Two states p and q olD are equivalent if = L'^. A state q is reachable if there exists w G E* such 
that 6{qo, w) = q. A DFA is minimal if all of its states are reachable and no two states are equivalent. 
Usually DFAs are used to establish upper bounds on the quotient complexity of operations and also 
as witnesses that meet these bounds. 

A nondeterministic finite automaton (NFA) is a quintuple D = (Q, E,d, I, F), where Q, E and 
F are defined as in a DFA, 6: Q x E 2^ is the transition function, and / C Q is the set of initial 
states. An e-NFA is an NFA in which transitions under the empty word e are also permitted. 
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The quotient DFA of a regular language L with n quotients is defined hy'D= (K, E, Sxi, Kq, F-p), 
where 6p{Ki,w) = Kj if and only if w~^Ki = Kj, and Fp = {Ki \ s € Ki}. To simplify the 
notation, without loss of generality we use the set Qn = {0,..., n—1} of subscripts of quotients as the 
set of states of then V is denoted hy V — (Qn, E, S, 0, F), where S{p, w) = q ii Sp{Kp, w) = Kg, 
and F is the set of subscripts of quotients in Fp. The quotient DFA of L is unique and it is 
isomorphic to each complete minimal DFA of L. 

A transformation of Qn is a mapping t: Qn Qn- The image of q € Qn under t is denoted by 
qt. The range of t is rng(t) = {q € Qn \ pt = q for some p £ Qn}- In any DFA, each letter a € E 
induces a transformation Sa of the set Qn defined by qSa = S{q, a). By a slight abuse of notation we 
use the letter a to denote the transformation it induces; thus we write qa instead of qSa- We also 
extend the notation to sets of states: if P C Qn, then Fa = {pa \ p £ P}. If s, t are transformations 
of Q, their composition is denoted by s * t and defined by q{s *t) = {qs)t-, the * is usually omitted. 
Then also for a word w = ai - ■ ■ ak, denotes the transformation (5ai • • • Sa^. induced by w. Let 7 q„ 
be the set of all n” transformations of Qn', then 7 q„ is a monoid under composition. 

For fc ^ 2, a transformation (permutation) t of a set P = {qo,qi,... ,qk-i} C Q is a k-cycle 
if qot = qi,qit = q 2 ,---,qk- 2 t = qk-i,qk-it = qo- This fc-cycle is denoted by {qo,qi, - - ■ ,qk-i)- 
A 2-cycle {qo, qi) is called a transposition. A transformation that changes only one state p to a state 
q^ pis denoted by (p —>■ g). A transformation mapping a subset P of Q to a single state and acting 
as the identity on Q \ P is denoted by [P ^ q). We also denote by [go, • ■ ■, gn-i] the transformation 
that maps p £ {0,..., n — 1} to g^. 

We now define dialects of languages and DFAs following m- Let E = {ai,...,afc} be an 
alphabet; we assume that its elements are ordered as shown. Let tt be a partial permutation of E, 
that is, a partial function tt: E ^ F where F C E, for which there exists ACE such that tt is 
bijective when restricted to Z\ and undefined on A \ Z\. We denote undefined values of tt by the 
symbol 

If P is a language over E, we denote it by L{ai,... ,ak) to stress its dependence on E. If 
TT is a partial permutation, let St^ be the language substitution defined as follows: for a G E, 
a !-)• {7r(a)} when 7r(a) is defined, and a<^ % when 7r(a) is not defined. For example, if A = {a, b, c}, 
L{a,b,c) = {a,b,c}*{ab,acc}, and 7r(a) = c, Tr{b) = —, and tt{c) = b, then St^{L) = {b,c}*{cbb}. In 
other words, the letter c plays the role of a, and b plays the role of c. A permutational dialect of 
L{ai,..., Ok) is a language of the form s,r(L(ai,..., Ofc)), where tt is a partial permutation of A; 
this dialect is denoted by L(7r(ai),..., 7r(afc)). If the order on A is understood, we use L{E) for 
L{ai,... ,ak) and L{tt{E)) for P(7r(ai),... ,7r(afe)). Undefined values appearing at the end of the 
alphabet are omitted. For example, if A = {a, b, c, d} then we write L{a, b) instead of L{a, b, —, —). 

Let A = {oi,..., Uk}, and let V = {Q, A, 6, qi,F) be a DFA; we denote it by P(ai,..., Ok) to 
stress its dependence on A. If tt is a partial permutation, then the permutational dialect 

P(7r(ai),...,7r(afc)) 

of P(ai,..., Ok) is obtained by changing the alphabet of V from A to tt{E), and modifying 5 so 
that in the modihed DFA 7r(ai) induces the transformation induced by ai in the original DFA; 
thus TT{ai) plays the role of at. One verifies that if the language L{ai,... ,ak) is accepted by DFA 
P(ai,..., Ok), then P(7r(ai),..., 7r(afc)) is accepted by P(7r(ai),..., 7r(afc)). 

In the sequel we refer to permutational dialects simply as dialects. 

Contributions 

1. We prove that a most complex stream of suffix-free languages does not exist. This is in con¬ 
trast with the existence of streams of most complex regular languages, right, left, and two-sided 
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ideals, prefix-free and proper prefix-convex languages. 

2. We exhibit a single ternary witness that meets the bounds for star, product, and boolean 
operations. 

3. We exhibit a single quinary witness that meets the bounds for boolean operations, reversal, 
number of atoms, syntactic complexity, and quotient complexities of atoms. 

4. We show that when m, n ^ 6 and m — 2 and n — 2 are relatively prime, there are binary 
witnesses that meet the bound (m — 1)2”“^ -|- 1 for product. 

5. We prove that any witness DFA for star and any second witness DFA for product must have 
transition semigroups that are subsemigroups of the suffix-free semigroup of transformations T^^(n) 
which has maximal cardinality for 2 ^ n ^ 5; that the witness DFAs for reversal, syntactic com¬ 
plexity and quotient complexities of atoms must have transition semigroups that are subsemigroups 
of the suffix-free semigroup of transformations T^®(n) which has maximal cardinality for n = 2, 3 
and n ^ 6; and that the witness DFAs for boolean operations can have transition semigroups that 
are subsemigroups of n 

2 SufRx-Pree Transformations 

In this section we discuss some properties of sufHx-free languages with emphasis on their syntactic 
semigroups as represented by the transition semigroups of their quotient DFAs. We assume that 
our basic set is always Qn = {0,..., n — 1}. 

2.1 SufRx-Pree Languages 

Let Vn = {Qm 27, (5, 0, F) be the quotient DFA of a suf&x-free language L, and let T„ be its transition 
semigroup. For any transformation t of Qm the sequence (0, Of, Ot^,...) is called the 0-path of t. 
Since Qn is finite, there exist i, j such that 0, Of,..., 0t\ ..., 0P~^ are distinct but OP = OP. 

The integer j — i is the period of t and if j — * = 1, t is initially aperiodic. The following properties 
of sufHx-free languages are known 

Lemma 1. If L is a regular suffix-free language, then 

1. There exists w € E* such that w~^L = 0; hence has an empty state, which is state n — 1 by 
convention. 

2. For w,x G 27+, if w~^L ^ 0, then w~^L ^ {xw)~^L. 

3. If L ^ $ and w~^L = L, then w = e. 

4- For any t G Tn, the 0-path oft in T>n is aperiodic and ends inn — 1. 

Property 3 is known as the non-returning property [25) and also as unique reachability m- 
An (unordered) pair {p, g} of distinct states in Qn \ {0, n — 1} is colliding (or p collides with q) 
in Tn if there is a transformation t G Tn such that Ot = p and rt = q for some r G Qn \ {0, n — 1}. 
A pair of states is focused by a transformation u of Qn if u maps both states of the pair to a single 
state r ^ {0, n — 1}. We then say that {p, q} is focused to state r. If L is a suffix-free language, then 
from Lemma [T] (2) it follows that if {p, q} is colliding in Tn, there is no transformation P G Tn that 
focuses {p, q}. So colliding states can be mapped to a single state by a transformation in T„ only if 
that state is the empty state n — 1. 

Following [12], for n ^ 2, we let 

B(n) = {t gTq \ 0 ^ rng(f), (n — l)t = n — 1, and for all j > 1, 
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Ot^ = n — 1 or Ot^ ^ qt^, Vg such that 0 < 9 < n — 1}. 


Example 1. We have B(2) = {[1,1]} and B(3) = {[1, 2, 2], [2,1, 2], [2, 2, 2]}. For n = 4, there are 17 
transformations satisfying Ot ^ qt. However, if t = [1, 2, 2, 3] or < = [2,1,1, 3], then , which 

violates the third condition for B; hence B(4) has 15 elements. The cardinality of B(5) is 115. ■ 

Proposition 1 im ). If L is a regular language with = (Qn, E,S,0, F) as its minimal DFA 
and syntactic semigroup T^, then the following hold: 

1. If L is suffix-free, then Tl is a subset o/B(n). 

2. If L has the empty quotient, only one final quotient, and C B(n), then L is suffix-free. 

Since the transition semigroup of a minimal DFA of a sufHx-free language must be a subsemi¬ 
group of B(n), the cardinality of B(n) is an upper bound on the syntactic complexity of suffix-free 
regular languages with quotient complexity n. This upper bound, however, cannot be reached since 
B is not a semigroup for n > 4: We have s = [1, 2, n — 1,..., n — 1] and t = [n — 1, 2, 2,..., 2, n — 1] 

in B(n), but st = [2, 2, n — 1,..., n — 1] is not in B(n). 

2.2 Semigroups T^®(n) with MELximal Cardinality when 2 ^ n ^ 5 

For n ^ 2, let 

T^®(n) = {t G B(n) | for all p,q G Qn where p ^ q, either pt = qt = n — 1 ox pt ^ qt}. 

Proposition 2. For n ^ 4, the semigroup T^®(n) is generated by the following set H ^^{n) of 

transformations of Q: 

— a: (O^n — l)(l,...,n — 2), 

— b: (0^>n-l)(l,2), 

— for 1 < p < n — 2, Cpi (p ^ n — 1)(0 —5> p). 

Forn = 4, a and b coincide, and so H^®(4) = {a, ci, C 2 }. Also, H^®(3) = {a, ci} = {[2,1, 2], [1, 2, 2]} 
and H^5(2) = {ci} = {[1,1]}. 

Proof. It was proved in m that for n ^ 4, the semigroup T^^(n) is generated by the following set 
G<5 (n) of n transformations of Qn'. a and b as above, and Cp for 1 ^ p ^ n — 2, defined by qc'p = q-fl 
for g = 0,... ,p — 1, pCp = n — 1, and qc'p = q tor q = p -\- 1... ,n — 1. Since H^®(n) C the 
semigroup generated by H^®(n) is a subsemigroup of T^®(n). So it is sufficient to show that every 
transformation in G^^(n) can be generated by H'^®(n). Transformation Cp changes {0,1,... ,p— 1} 
to {1, 2,... ,p}, p is mapped to n — 1, and {p -I- l,p -I- 2,..., n — 2) is mapped to itself. The image 
of Qn \ {p, n — 1} is thus {1, 2,... ,p,p -|- 1,... n — 2}. The image of Qn \ {p,n — 1} under Cp is 
{p, 1,2,... ,p—l,p-|-l,..., n — 2}. However, since transformations a and b restricted to (5„\{0, n —1} 
generate all permutations of Qn \ {0, n — 1}, {p, 1, 2,... ,p — l,p-|- 1,..., n — 2} can be transformed 
to {1, 2,... ,p,p -I- 1,... n — 2}. Hence the claim holds. 

From now on we use the transformations of Proposition [2] for T^5(n). A DFA using these 
transformations is illustrated in Figure [2j 
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Fig. 2. DFA with T^®(n) as its transformation semigroup; Cp = {ci,..., c„_ 2 } \ {cp}. 


Proposition 3. For n ^ 2, T^®(n) is the unique maximal semigroup of a suffix-free language in 
which all possible pairs of states are colliding. 

Proof. For each pair p,q € Q\ {0, n — l},pf^q, there is a transformation Cp € T^^(n) with Ocp = p 
and qcp = q. Thus all pairs are colliding. If all pairs are colliding, then for each p, q G Q \ {n — 1}, 
there is no transformation t with pt = qt n — 1, for this would violate sufSx-freeness. By definition, 
(n) has all other transformations that are possible for a sufHx-free language, and hence is unique. 

Proposition 4. Porn ^ 5, the number n of generators o/T^®(n) cannot be reduced. 

Proof. If a generator t maps OtopS {I,...,n — 2}, then it must also map a state q £ {I,...,n — 2} 
to n — I, since the 0-path of t is aperiodic and ends in n — I. Thus t cannot generate a permutation of 
,n —2}. Since T^®(n) has all transformations that map 0 to n —I, permute {!,... ,n —2} and 
fix n — 1, we need two generators, say a and 6, with Oa = 06 = n — I to induce all the permutations. 
For p G {I,..., n — 2}, consider the transformation Cp of Proposition [5] We need a generator, say 
Op, which maps 0 to a state p in {I,..., n — 2}, and a state q from {I,..., n — 2} to n — I. But since 
Cp maps only two states, q and n — I, to n — 1 and {I,..., n — 2} C rng(cp), the other generators 
involved in the composition of generators that induces Cp do not map any state from {1,..., n — 2} 
to n — I. Since there are n — 2 distinct transformations Cp G T^^(n), one for eachp G {I,...,n —2}, 
we need at least n — 2 generators Op. This gives n generators in total. 

Example 2. T^^(4) has 13 elements. All transitions of B(4) are present except [3,1,1,3] and 
[3, 2, 2, 3]. The semigroup T^^(5) has 73 elements. ■ 

Semigroups T^^(n) are suffix-free semigroups that have maximal cardinality when 2 Gi n ^ 

5 m- 
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2.3 Semigroups T^®(n) with MELximal Cardinality when n — 2,3 and n ^ 6 

For n ^ 2, let 

T^®(n) = {t G B(n) | Ot = n — 1, or qt = n — IV q such that 1 < g ^ n — 2}. 


Proposition 5 t |20| i. For n ^ 4, T^®(n) is a semigroup contained in B(n), its cardinality is 
{n — + (n — 2), and it is generated by the set G^®(n) of the following transformations: 

— a: (0 —>■ n — 1)(1,...,n — 2); 

— b: (0^n-l)(l,2); 

— c: (0 —>■ n — l)(n — 2 ^ 1); 

— d: ({0,1} —>■ n — 1); 

— e: {Q \ {0} —>■ n — 1)(0 —>■ 1). 

For n = 4, a and b coincide, and so G^®(4) = {a, c, d, e}. Also G^®(3) = {a, e} = {[2,1, 2], [1, 2, 2]} 
and G^®(2) = {e} = {[1,1]}. 

Example 3. T^®(4) has 11 elements. All transitions of B(4) are present except [1, 2, 3, 3], [1, 3, 2, 3], 
[2,1, 3, 3] and [2, 3,1, 3]. Semigroup T^®(5) has size 67. ■ 

A DFA using the transformations of Proposition [5] is shown in Figure |31 


a, c 



S 


Fig. 3. DFA with T^®(n) as its transformation semigroup for the case when n is odd. 


Semigroups T^®(n) are suffix-free semigroups that have maximal cardinality when n ^ 6 


3 Witnesses with Transition Semigroups in T'^^(n) 

In this section we consider DFA witnesses whose transition semigroups are subsemigroups of T^®(n). 
We show that there is one witness that satisfies the bounds for star, product and boolean operations. 
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Definition 1. For n ^ 6, we define the DFA = {Qn, S, 0, {!}), where Qn = {0,..., n — 1}, 
S = {a,b,c}, and S is defined by the transformations 

- a: (0-l>n-l)(l,2,3)(4,...,n-2), 

- 5: (2 n - 1)(1 ^ 2)(0 ^ 1)(3,4), 

- c: (O^n — — 2). 

Theorem 1 (Star, Product, Boolean Operations). Let'Dn{a,b,c) be the DFA of Definitions^ 
and let the language it accepts be Ln{a,b^c). For n ^ 6, Ln and its permutational dialects meet the 
bounds for star, product and boolean operations as follows: 

1. L’^(a, b, —) meets the bound 2"’“^ + 1. [Cmorik and Jiraskova P5] ] 

2. Lm{a, b, c) • Ln{h, c, a) meets the bound (m — 1)2"“^ + 1. 

3. Lm{a, b, —) and Ln{—, b, a) meet the bounds mn— (m+n—2) for union and symmetric difference, 
mn — 2(m + n — 3) for intersection, and mn — (m + 2n — 4) for difference. 

The claim about the star operation was proved in |23] . We add a result about the transition 
semigroup of the star witness and prove the remaining two claims in this section. 

3.1 Star 

In 2009 Han and Salomaa [25] showed that the language of a DFA over a four-letter alphabet meets 
the bound 2"“^ -1- 1 for the star operation for n A. The transition semigroup of this DFA is 
a subsemigroup of T^®(n). In 2012 Cmorik and Jiraskova [23] showed that for n ^ 6 a binary 
alphabet {a, 6} suffices. The transition semigroup of this DFA is again a subsemigroup of T^®(n). 
We prove that these are special cases of the following general result: 

Theorem 2. For n ^ 4, the transition semigroup of a minimal DFA D{Qn, S, 6, 0, F) of a suffix- 
free language L that meets the bound 2"“^ + 1 for the star operation is a subsemigroup o/T^®(n) 
and is not a subsemigroup o/T^®(n). 

Proof. To show that the transition semigroup of U is a subsemigroup of T^®(n), it suffices to show 
that every pair of states is colliding. 

We construct an NFA Af for L* by making 0 a final state in - this is possible since 0 is 
uniquely reachable - and adding an empty-word transition from every final state to 0. We then 
determinize Af using the subset construction to get a DFA V* for L*. The states of V* are sets of 
states of V. 

Consider a subset S C Q^. We can assume that n — 1 £ S, since S and S' U {n — 1} cannot be 
distinguished. If S {0, n— 1} and S {n— 1}, then S can be reached only if 0 G S and SCiF y^ 0, 
or 0 ^ S and SOF — (h, because by the construction for star in Af there is an e-transition from every 
final state to the initial state 0 and no transformation fixes 0. Thus, to meet the bound 2"’“^ + 1, 
for each possible subset of{l,...,n — 2} there must be a reachable subset S containing that subset. 

Suppose that p,q G {1,..., n — 2} are not colliding, that is, there is no transformation t with 
Ot = p and q G rng(t). Consider S that contains both p and q. Since the e-transitions from final 
states to initial state 0 are the only sources of nondeterminism, S must be reached from a subset 
S' containing 0 and q' G {l,...,n — 2} by a transformation t with t)t = p and q't = q (or Ot = q 
and q't = p), which contradicts that p, q are not colliding. 

Since transformations causing colliding pairs are not in T^®(n), the transition semigroup of D 
cannot be a subsemigroup of T^®(n). 
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3.2 Product 


To avoid confusing the states of the two DFAs in a product, we label the states of the first DFA differ¬ 
ently. Let V'^ = V'^{a,b,c) = where Q'^ = {O',..., (m-1)'}, and 5'{q',x) = p' 

if S{q, x) = p, and let = T^nib, c, a). We use the standard construction of the e-NFA Af for the 
product: the final state of becomes non-final, and an e-transition is added from that state to 
the initial state of This is illustrated in Figure S] for m = 9,n = 8. 



Fig. 4. The NFA Af for product LQ(a,b,c) ■ Ls{b,c,a). The empty states 8' and 7 and the transitions to 
them are omitted. 


We use the subset construction to determinize Af to get a DFA V for the product. The states 
of V are subsets of Q'^ U Qn and have one of three forms: {O'}, {l',0} U S' or {p'} U S, where 
p' — 2'{m — 1)' and S C {1, ..., n — 1}. 

Note that for each x G S every state q G Qn \ {0,n — 1} has a unique predecessor state 
P G Qn \ {n — 1} such that px = q. For w G S*, the rc-predecessor of S C Q„ \ {0, n — 1} is denoted 
by Sw~^. 


Lemma 2. For each n ^ 6 and each q G Qn there exists a word Wq G c{a,b}* such that Vwq = 3', 
Qwq = q, and each state of Qn \ {0, g, n — 1} has a unique Wq-predecessor in Qn \ {0, n — 1}. In fact, 
the following Wq satisfy these requirements: 
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(1) 


cab^, 
ca, 

Wq = < cab'^, 


cab^a^'i-'^, 


ifq = 1 ; 
ifq = 2 ; 
ifq = 3; 

if q ^ A and q is even; 
if q ^ 5 and q is odd. 


Proof. It is easily verified that in each case Vwq = 3^ and Owq = q. Note that a and b induce 
permutations on Qn \ {0, n — 1} and c is a one-to-one mapping from Qn \ {0, n — 1} to Qn \ {0,1}. 
Thus every state in Qn \ {0, n — 1} that is mapped by Wq to Qn \ {0, n — 1} has a Wg-predecessor 
in Qn \ {0, n — 1}, and state q has 0 as its rcg-predecessor. 

Theorem 3 (Product: Ternary Case). For m,n^ 6, the product L'n^{a,b,c) ■ Ln{b,c,a) meets 
the bound (m — 1)2"“^ -|- 1. 

Proof. Let P consist of the following states {O'}, 2"“^ sets of the form {!', 0,n — 1} U S' and 
(m — 2)2"’“^ sets of the form jp', n — 1} U S, where S C {1,..., n — 2}, and p' = 2',..., (m — 1)' - a 
total of (m — 1)2"’“^ -|- 1 sets. We shall prove that all these states of V are reachable and pairwise 
distinguishable. This together with the known upper bound will prove that the witnesses indeed 
meet the bound for product. 

Consider the distinguishability of two states in P. If we apply any word ending in c to any 
subset of Qn \ (Oj, the resulting set does not contain the final state I. Suppose that one of the 
states in our pair is {O'}; this is the only state accepting be. Next, if one of the states has the form 
{!', 0, n — 1} U S' and the other is {p', n — 1} U i?, then the former state accepts c, whereas the latter 
does not. If p < q, then {p', n — 1} U i? is distinguished from {q', n — 1} U S by This leaves 

the case where the state in \ {O'} is the same in both sets in our pair. If the sets in Qn are R 
and S, S, and q € R(B S, then distinguishes these states. 

Now we turn to reachability. Since state 0' is initial in A/", {O'} is reachable. First we show that 
the sets in P are reachable if S' = 0. The set {l',0,n — 1} is reached by ba^, {2',n — 1} by ba, 
{p', n — 1} by bacP~^ for p = 3,..., (m — 2), and {(m — 1)', n — 1} by b^. 

Now suppose all sets of the form {!', 0, n — 1} U S and {p', n — 1} U S, p = 2,..., m — 1, with 
S C {1,..., n — 2} and |S| = k, are reachable. We show that if fc < n — 2, then every set with S of 
size fc -I- I can be reached. In each case we assume that q ^ S. 

1. Sets with 3'. We add g to S by applying Wq to ({!', 0, n — 1} U SWq^). By LemmaO every state 

except q in Qn \ {0, n — 1} has a unique Wg-predecessor in Qn \ {0, n — I}. Hence, assuming we 

have {3', n — 1} U Sw~^ we can add q since 

({!', 0, n — 1} U Swq^)wq = {3', n — 1} U S U {g}. 

2. Sets with 4': 

(a) ({3', n - 1} U S&-1 U {3})6 = {4', n - 1} U S U {!}, 

(b) ({3', n - 1} U S&-1 U {l})b = {4', n - 1} U S U {2}, 

(c) ({3', n - 1} U S&-1 U {2})b = {4', n - 1} U S U {3}, 

(d) ({3', n - 1} U S6-1 U {m - 2})b = {4', n - 1} U S U {4}, 

(e) ({3', n — 1} U Sb~^ U {q})b = {4', n — 1} U S U {g -I- I}, for q = 4,..., n — 2. 
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3. Sets with p', p = 5,..., m — 2: 


({(p — 1)', n — 1} U S{b'^ab) ^ U {3})b^ab = {p', n — 1} U S' U {!}, 

({(p — 1)', n — 1} U Sa~^ U {g — l})a = {p', n — 1} U S U {q} for g = 2,..., n — 2. 

4. Sets with (m — 1)': If 1 ^ S, then S has a c-predecessor in Qn \ {0, n — 1} and ({!', 0, n — 1} U 
Sc~^)c = {2', n — 1} U S U {!}. Thus we have 

({2', n — 1} U S{bab)~^ U {l})bab = {{m — 1)^ n — 1} U S U {!}, 

({2',n- 1}US5-1U{1})6= {(m- l)',n- 1}USU{2}, 

({(to — l)',n — 1} USa“^ Ujg — l})a = {(to — l)',n — ljUSU {g}, 

for g = 3,..., n — 2. 

5. Sets with 1': 

({3',n — 1}U Sa~^ U{n — 2})a = {l',0,n — 1}USU{1}, 

({3',n - 1} USa"^ U {l})a = {!', 0, n - 1} U S U {2}, 

({2', n - 1}U S(a2)-i U{l})a2 = {!', 0, n - 1} U S U {3}, 

({!', 0, n — 1} U S(a^)“^ U {g — 3})a^ = {!', 0, n — 1} U S U {g}, 

for g = 4,..., n — 2. 

6. Sets with 2': 


({!', 0, n — 1} U Sc ^)c = {2', n — 1} U S U {!}, as in Case 4, 

({!', 0, n — 1} U Sa~^ U{g—l})a = {2', n — 1} U S U {g}, 
for g = 2,..., n — 2. 

Thus the induction step goes through and all required sets are reachable. 

Cmorik and Jiraskova [231 Theorem 5] also found binary witnesses that meet the bound (to — 
1)2"“^ in the case where to — 2 and n — 2 are relatively prime. It remained unknown whether the 
bound (to — 1)2"“^ + 1 is reachable with a binary alphabet. We show in the Appendix that a slight 
modification of the first witness of [23] meets the upper bound exactly. 

For TO ^ 6, n ^ 3, let the first DFA be that of [23], except that the set of final states is changed to 
{2', 4'}; thus let A = {a, b}, V'^{a, b) = {Q'^, E, S', O', {2', 4'}), and let T>„(a, b) = {Q„, E, S, 0, {!}), 
with the transitions shown in Figure El Let L'^(a, b) and Ln(a, b) be the corresponding languages. 

Theorem 4 (Product: Binary Case). Form,n ^ 6, L'^{a,b) is suffix-free and L'^{a,b)-Ln{a,b) 
meets the bound (to — 1)2"'“^ + I when to — 2 and n — 2 are relatively prime. 

Proof. First we need to show that b) is minimal and suffix-free. For minimality, it is easy to 

verify that every pair of states is distinguished by a word of the form a'6. Suppose that L'^(a, b) is not 
suffix-free; then there are some words u, v such that v G b) and uv G L'm{a^ b). Since there is no 

transformation mapping 0' to itself, this means that v maps both 0' and g' = O'u G {!',..., (to — 2)'} 
to a final state. Clearly v = bw\ for some w and so g must be 2'. Then w maps I' and 2' to a final 
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Fig. 5. The witness DFA and Dn for product. The transitions to the empty states (m — 1)' and n — 1 
are omitted. 

state. Since these two states cannot be merged to any state other than {m — 1)', and b sends every 
state from {!',..., (m — 2)'} except 2' to {m — 1)', we have w = a\ But a® preserves the distance 
between the states in the cycle, and so 1' and 2' cannot be mapped simultaneously to 2' and 4'. 

The proof for meeting the bound (m — 1)2"'“^ + 1 is similar to that of [23l Theorem 5]. We 
construct the NFA for L'^{a,b)Ln{a,b) as usual. We show that {m — 1)2"“^ + 1 subsets of the 
states are reachable and distinguishable. Since a subset S without n — 1 cannot be distinguished 
from S' U {n — 1}, we consider only S as the representative for both S and S U {n — 1}. Similarly, 
a reachable subset S contains 0 if and only if it contains either 2' or 4'; we use only S \ {0} as the 
representative for these subsets. 

Let X C {1,..., n —2}. If X = 0, then clearly each of the m—1 subsets {p'} for p e {1,..., n —1} 
is reachable by baP~^. Let X ^ 0. To show that each of the (m — 2)(2"'“^ — 1) subsets S = {p'} U X 
for i G {!,..., gm- 2 } is reachable, we follow the proof of [23l Theorem 5] exactly. In this proof 
note that A' G F makes no difference, since b maps 4' to (m — I)', and so if applied to a subset 
containing 4', it results in a set with (m — 1)'. For the 2"“^ — I subsets S = {{m — I)'} UX consider 
S' = {!'} U X. Then S = S'b, and since S' is reachable, so is S. 

Finally, we show that all these (m — 1)2"“^ sets together with the initial state {O'} are distin¬ 
guishable. Set (O'l is distinguished from every other subset by bab. Consider {p'} U X and {q'} U Y 
for distinct p,q G (1,..., m — 2}, and some X,Y C (I,..., n — 2}. Then w = a"'~^b maps p' to 
2', and q' to (m — 1)'. Thus {{p'} U X)wb contains the final state 1, and ({g'} U X)wb does not. 
Consider {p'j U X and (q'} U Y with X ^ F; then X and Y differ in some r G {1,..., n — 2}. Thus 
^n-r-i distinguishes these subsets. 

Theorem 5. Suppose m,n ^ 4 and L'^Ln meets the bound 2"“^4-1. Then the transition semigroup 
Tn of a minimal DFA Dn of Ln is a subsemigroup o/T^^(n) and is not a subsemigroup o/T^®(n). 

Proof. Let = (Q^, X, F, O', F') with Q'^ = {O',..., (m - 1)'}, and let = {Qn,X,6,0,F) 
with Qn = {0, ■ ■ ■ ,n — 1}. We construct the NFA for L'^Ln as usual. 

From Lemma 9] we know that the set of reachable and distinguishable subsets of Q'^ U Qn 
can be represented by: {p'} U X for each p G {I,..., m — 1} and X C {I,..., n — 2}, or by {O'}. 
A reachable subset S contains 0 if and only if S' fl F' 7 ^ 0. Also S is not distinguishable from 
S U {(m — I)'} and S U {n — 1}. To reach the bound (m — 1)2"“^ + 1, all these subsets or their 
equivalents must be reachable. 
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Fig. 6. Witnesses T)'^[a,b, —) and 'Ds{—,b,a) for boolean operations. The empty states 8' and 7 and the 
transitions to them are omitted. 


Suppose that p,q € {l,...,n — 2} are not colliding in Qn- Consider S that contains both p 
and q. Then S is reached from some S' by a transformation t, where S' contains 0 and a state 
r € {l,...,n — 2} such that Ot = p and rt = q (or Ot = q and rt = p). But since p and q are not 
colliding, there is no such transformation in T„. Thus all pairs in Qn \ {0, n — 1} must be colliding 
and Tn is a subsemigroup of 

3.3 Boolean Operations 

Let V = I?(a, b, c) be the DFA of Definition [U and let L = L{a, b, c) be the language of this DFA. 
Consider the partial permutation 7 r'(a) = a, 7 r'( 6 ) = b, and 7 r'(c) = —. The dialect associated with 
tt' is 'D'{a,b,—), which is V(a,b,c) restricted to the alphabet {a, 6 }. This is our first witness for 
boolean operations, and we prime its states to distinguish them from those of the second witness 
defined below. This DFA is illustrated in Figure | 6 ] for m = 9. 

Now take the partial permutation tt, where 7r(a) = —, 7r(6) = b, and 7r(c) = a; here a plays the 
role of c. Thus V{—, b, a) is the DFA in which a: (0 ^ n — 1)(1,..., n — 2) and b: (2 —n — 1)(1 —>■ 
2)(0 —!> 1)(3,4), as illustrated in Figure |6] for n = 8. This DFA is our second witness for boolean 
operations. The language of D(—, b, a) is the dialect L{—, b, a) of L. 

Theorem 6. For m,n ^ 6 , L'^{a,b,—) and Lni—,b,a) meet the bounds mn — (m + n — 2) for 
union and symmetric differenee, mn — 2(m + n — 3) for interseetion, and mn — (m + 2n — 4) for 
differenee. 

Proof. As usual, we construct the direct product V of V!^ and I?„. The initial state of V is (0',0) 
which reaches only (!', 1) and can never be re-entered. Thus 0' and 0 do not appear in any reachable 
state of V other than (0',0). We show that the remaining (m — l)(n — 1) states are all reachable, 
for a total of 1 -I- (m — l)(n — 1) = mn — (m + n — 2) states. Let m define the row and n the column 
in the direct product. 

As a preliminary step we show how ((m — 2)',n — 2) can be reached. First we reach (4', 4) 
by ba'^b. If m = 6 , then (4',4)a'^“'^ = {4:',q) for q = 5,...,(n — 2). Otherwise, if m > n, then 
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(o', 0)ba^{ba)'^ = (5', 4), (5', = ((to— n+4)', 4), and (to— n+4,4)a”“® = ((to—2)', n—2). 

If TO = n, then (4', 4)a”“® = ((to — 2)', n — 2). Let m < n. Observe that it (n — m) = p (mod 3) for 
0 ^ p < 2, then ((3 — p)', 4)a”“’”6a'"“® = ((to — 2)', n — 2). Hence we need to reach ((3 — p)', 4). 
We consider the following three cases: 




1. (n — to) = 0 (mod 3). 

We reach (3', 4) by b'^a^ba, and then apply a" 

2. (n — to) = 2 (mod 3). 

We reach (l',4) by b^a^, and then apply 

3. (n — to) = 1 (mod 3). 

If TO ^ n — 4 then (!', 4)a^6o”“"'“^5a"'“® = ((to — 2)', n — 2). Otherwise to = n — I. If n < 12, 
then we have five cases, where (0',0)6a^ = (3', 3): 


(a) TO = 6, n = 7: 

(b) TO = 7, n = i 

(c) TO = 8, n = 9: 


I: i4',4)a^ba^ = (2', 4). 

1: (3',3)(a6)^o‘*5a^ = (2',4). 

): (3',3)a8 = (2',4). 

(d) TO = 9, n = 10: (3',3)a&a® = (2', 4). 

(e) TO = 10, n = 11: {3',3)aba‘^ba^baba^ba‘^ = (2',4). 

Otherwise we reach (2',4) as follows: (4',4)a’"“® = (4',n — 2), (4',n — 2)a’"“® = (4',n — 7), 
(4', n - 7)baba%a^ = (2', 1), (2', 1)0^ = (2', 4). 


Now, having reached ((to — 2)', n — 2), we can reach all the remaining pairs: We have ((to — 2)', n — 
2){ab)‘^b = ((to —1)', 3), and from ((to —1)', 3) all pairs ((to— 1)', q) for 1 ^ < n—1 can be reached, 

since Qn \ {0,n — 1} is a strongly connected component in Similarly, ((to — 2)',n — 2)ab^ = 
(4', n — 1), and all pairs (p', n — 1) for 1 ^ p ^ to — 1 can be reached. For the remaining pairs we 
proceed as follows: 


1. Column 3: 

(a) ((to — 2)', n — 2)a6a = (1', 3), 

(b) (0,0)6a3&= (2',3), 

(c) (0,0)6a2 = (3',3), 

(d) (2',3)a&=(4',3), 

(e) ((p — 1)', 3)ab = (p', 3) for p = 5,..., to — 2. 

2. Column 4: 

(a) (3',3)a = (l',4), 

(b) (3',3)&=(4',4), 

(c) ((p - 1)', 3)a = ip', 4) for p = 2, 3, 5,..., TO - 2. 

3. Column g for g = 5,..., n — 2: 

(a) i3',q-l)a = {l',q), 

(b) {3',q)b= i4',q), 

(c) ((p - 1)', q-l)a= ip', g) for p = 2, 3, 5,..., TO - 2. 

4. Column 1: 

(a) (0',0)&=(1',1), 

(b) ((TO-2)',n-2)a= (4',1). 

(c) Up — 1)', n — 2)a = ip', 1) for p = 2,3, 5,..., TO — 2. 

5. Column 2: 

(a) (3',l)a = (l',2), 

(b) (0',0)a2 = (2',2), 

(c) (3',1)&=(4',2), 
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(d) {{p - 1)', l)a = {p', 2) for p = 3, 5,..., m - 2. 


As we discussed in Subsection 13.21 for each x £ S every state q £ Qn \ {0, n — 1} has a unique 
predecessor state p £ Qn\ {n — 1}. It follows that ii qw = p £ Qn \ {0, n — 1}, for some state q 
and word w, then rw ^ p for r ^ q. The same facts apply to Q'^. We shall need the following two 
claims: 

Claim 1. From any pair {q' ,p) with — 2, — 2we can reach (!', 1). 

First we find a word w such that q'w £ {!', 2', 3'} and pw = 1. Note that it is sufficient to find u 
such that q'u £ {!', 2', 3'} and pu £ Qn \ {0, n — 1}, because then w = does the job. So 

if q' £ {l',2',3'}, then we are done. Otherwise use which maps q' to 4'. If pa™‘~^~'^ ^ 2, 

then we use b to map 4' to 3' and keep the state of in \ {0,n — 1}, and again we are done. 
Suppose = 2. If m = 6 then 4'a6 = 3' and pa'^~^~'^ab ^ 2, and we are done. Otherwise 

4 /^ 771-5 _ 4 'q,^q,"i -6 _ 4 '^ jf 2 a™“® ^ 2 then we can apply an additional b and be done. 

If 2a^~^ = 2, then 2aba^~^ = 3, and again by applying an additional b we are done. 

Now let q' £ {2', 3'}. We show that there is a word w such that q'w = 1 and Iw = 1. If 
q'a''~‘^ = 1' then we are done, and if q'a'^~‘^ £ {2', 3'} \ {<?'}, then q'{a''~'^Y = 1 and we have 
reached (!', 1). So assume q'a''~'^ = g'; thus (n — 2) = 0 mod 3, and so n ^ 8. 

If q' = 3' then (3', = (r', 1) with r' Y 3', so we are done by . If g' = 2' 

then (2', = (r', 1) with r' Y 2^ so we are done by 

Claim 2. For any q' £ Q'^ \ {O', (m — 1)'} and p £ Qn\ {0, (n — I)} there is a word such that 
q'w £ Q'^ \ (O', (to — 1)'} and pw = n — I. It follows that {q',p) can be mapped to (r', n — 1), for 
any state r' £ Qm \ {0^}- 

If q' £ {l',2',3'} then let Wi = a^b, ab, or b, respectively, and e otherwise; so Wi maps q' to 
{4',..., (to- 2)'}. Then let W 2 = so W 2 mapspwi to 2, and qwi is still in {4',..., (to —2)'}. 

Then w = wiW 2 b satisfies the claim. Since \ {O', (to — 1)'} is strongly connected, we can map 
{q'w, n - 1) to any r' £ Q'^\ {O'}. 

Now we consider the four cases of the operations. In all cases pair (O', 0) is distinguished as the 
only non-empty state which does not accept any word starting with a. 

1. Union and symmetric difference. 

Consider {q'^,pi) and {q' 2 ,P 2 ) with Y where 1 ^ ( 71,(72 ^ to — 1 and 1 ^ Pi,P 2 ^ n — 1. 
Without loss of generality, ( 7 i 7 ^^ (to — 1)'. The same arguments here work for both operations, 
since we are not mapping the pairs to (!', 1). By Claim 2 we can map {q'i,Pi) to (l,n. — 1) 
by some word w. Then q' 2 W Y 1- If ^Iso P 2 Y !> then (( 72 ,^ 2 ) is not final and w distinguishes 
our pairs. Otherwise we apply Claim 2 once more for ( 1 , 732 ), obtaining a word u such that 
(1, n — l)it = (1, n — 1), and (( 72 , l)u = (( 72 ^, n — 1). Since lu = u, we have q' 2 U Y 1, and so w'u 
distinguishes our pairs. 

Assume now ( 7 } = and pi Y P 2 , where pi 7 ^ n — 1. Let w be a word mapping pi to 1; then 
P 2 W Y 1- If Y l^ {qYp 2 )w = {q'i,P 2 )w is not final so w distinguishes our pairs. Otherwise 
wa^b'^ maps g( to (to — 1)', and pi to 4. Thus wa^b^a'^~^ maps pi to 1, and since p 2 is mapped 
elsewhere and g( = q '2 to (to — 1 )', our pairs are distinguished. 

2. Intersection. 

Consider (g{, 73 i) and ( 52 ,P 2 ) with 7 ^ q '2 or pi Y P 2 , where 1 qi,q 2 m — 2 and 1 pi,P 2 ^ 
n — 2. By Claim 1 we can map (g(,pi) to the final state (!', 1) by some word w. Then either 
q' 2 W Y 1^ or P 2 W Y I, {<iYp 2 ) is not final and our pairs are distinguished. Together with (0',0) 
and ((to — 1)', n — 1) these give 2 -f (to — 2)(n — 2) = mn — 2{m + n — 3) distinguished pairs. 
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3. Difference. 

Consider {q'i,pi) and {q 2 TP 2 ) with q'l ^ where 1 ^ 9i, <72 ^ m —2 and 1 ^ Pi,P 2 ^ n— 1. This 
follows in exactly the same way as the corresponding case of union and symmetric difference. 
Assume now q'l = q 2 and pi ^ P 2 - Without loss of generality, pi ^ n — 1. By Claim 1 we can 
map {q[,pi) to the non- final state (l^ 1) by some word w. Then {q 2 ,p 2 )w = {l',p 2 w), and 
since p 2 W ^ piw = 1, {l',p 2 'w) a final state. Thus w distinguishes our pairs. 

Together with (O', 0) and ((m — 1)', n — 1) these give 2 -|- (m — 2)(n — 1) = mn — (m + 2n — 4) 
distinguished pairs. 


4 Witnesses with Semigroups in T^®(n) 

We now turn to the operations which cannot have witnesses with transition semigroups in 

Definition 2. For n ^ 4, we define the DFA Vn{a,b,c,d,e) = {Qn, F!,6,0, F), where Qn = 
{0,..., n — 1}, E = {a, b, c, d, e}, <5 is defined by the transformations 

— a: (O^n — l)(l,...,n — 2), 

— b: (0^n-l)(l,2), 

— c: (0 —>■ n — l)(n — 2 —)• 1), 

— d: ({0,1} ^n - 1), 

— e: (Q \ {0} n — 1)(0 —^ 1), 

and F = {q & Qra\{0, n—1} | q is odd}. For n = 4, a and b coincide, and we can use E = {b,c,d,e}. 
The structure of 'D^{a,b,c,d,e) is illustrated in Figure}^ 

Our main result in this section is the following theorem: 

Theorem 7 (Boolean Operations, Reversal, Number and Complexity of Atoms, Syn¬ 
tactic Complexity). Let Vn^a, b, c, d, e) be the DFA of Definitions^ and let the language it accepts 
be Ln{a,b,c,d,e). Then Ln{a,b,c,d,e) meets the bounds for boolean operations, reversal, number 
and quotient complexity of atoms, and syntactic complexity as follows: 

1. For n,m 4, Lm(a, b, —, d, e) and Ln(b, a, —, d, e) meet the bounds mn — {m + n — 2) for union 
and symmetric difference, mn — 2{m -I- n — 3) for intersection, and mn — (m -|- 2n — 4) for 
difference. 

2. For n'^ 4, Lnifl, —, c, —, e) meets the bound 2"“^ -|- 1 for reversal and number of atoms. 

3. For n ^ 6, b, c, d, e) meets the bound (n — 1)"“^ -|- n — 2 for syntactic complexity, and the 

bounds on the quotient complexities of atoms. 

The claim about syntactic complexity is known from [20] . It was shown in m that the number 
of atoms of a regular language L is equal to the quotient complexity of L^. In the next subsections 
we prove the claim about boolean operations, reversal, and atom complexity. First we state some 
properties of X>n. 

Proposition 6. For n ^ 4 the DFA of Definitions^ is minimal, accepts a suffix-free language, and 
its transition semigroup Tn has cardinality (n — 1)"“^ -|- n — 2. In particular, Tn contains (a) all 
(n — 1)"“^ transformations that send 0 and n — 1 to n — 1 and map Q \ {0,n — 1} to Q \ {0}, 
and (b) all n — 2 transformations that send 0 to a state in Q\ {0, n — 1} and map all the other 
states to n — 1. Also, T„ is generated by {a,b,c,d,e} and cannot be generated by a smaller set of 
transformations. 
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Proof. To prove minimality, note first that only the initial state 0 accepts e, and only state n — 1 
accepts nothing. Suppose p,q € Qn \ {0,n — 1}, p < q, and p and q have the same parity; then 
pa^~‘^~Pc = 1, which is a final state, but pa^~^~'^c is an even state, which is non-final. If p and q 
have different parities, then one is final while the other is not. Hence 'Dn is minimal. 

The language is suffix-free, because every word in has the form ex with x € (a, &, c, d}. 
The claims about cardinality of T„, transformations, and generators were proved in |12I20] . 

4.1 Boolean Operations 

We now show that witness DFAs for boolean operations may have transition semigroups in 
Thus boolean operations have witnesses with transition semigroups that are subsemigroups of either 
or This is sharp contrast with all the other complexity measures: any witness for star 

and any second witness for product can only be associated with while the size of the syntactic 
semigroup, reversal, and complexities of atoms must have witnesses associated with T^®. 

Theorem 8. For n, to ^ 4, Lm{a, b, —, d, e) and Ln{b, a, —, d, e) meet the bounds mn — (to -|- n — 2) 
for union and symmetric difference, mn — 2 (to -|- n — 3) for intersection, and mn — (to -|- 2n — 4) 
for difference. 

Proof. Our two DFAs Vmia, b, —, d, e) and Vnib, a, —, d, e) are illustrated in Figure[7l consider their 
direct product. Let Jd = {((to — l)',g) — 2}, H = {(p',n — 1) | 1 ^ p ^ to — 2}, and 

M = {{p', q) I l^p^TO — 2,l^g^n — 2}. 

Let An{a, b) = (Qra\{0, n —1}, {a, b}, S, 1, F) be the DFA obtained from the DFA Vffa, b, —, d, e) 
by restricting the alphabet to {a, b} and the set of states to {1,..., n — 2}. Since DFAs A'^{a, b) and 
y4,i(5, a) have ordered pairs and {(5a,(5{,} of transformations that generate the symmetric 

groups of degrees to and n and are not conjugate, the result from [2 Theorem 1] applies, except in 
our case where to = 4 and n = 4 (we add two states to the to and n in m)- We have verified this 
case by computation. Therefore we know that all states in M are reachable from state (!', 1) and 
all pairs of such states are distinguishable. 

We show that (O', 0), ((to — 1)', n—1) and all states in H, V and M are reachable. State (O', 0) 
is the initial state, (!', 1) is reached from (O',0) by e and ((to — 1)',n—1) by a. By Theorem 1 of [2] 
all the states in M, and in particular (!', 2) and (2', 1), are reachable from (!', 1). From state (!', 2) 
we reach ((to — 1)', 2) by d and from there, state ((to — 1)', q) by a word in b* for 1 ^ q ^ n — 2. 
Symmetrically, from (2', 1) we reach (2', n — 1), and from there, state (p', n — 1) for 1 ^ p ^ to — 2 
by a word in a*. Hence all (to — l)(n — 1) -I- 1 = mn — (to -I- n — 2) states are reachable. 

State (0',0) is the only state accepting a word beginning with e. State ((to — l)',n — 1) is the 
only empty state. 

For union we consider six possibilities for the distinguishability of two states: 

1. States in H. If n is odd, and p', r' G \ {O', (n — 1)'} with p < r, where p' and r' are both final 

or both non-final, then r' accepts whereas p' rejects it. If n is even, r' accepts aP~‘^b 

whereas p' rejects it. Hence if ((to — l)',p) and ((to — 1)', r) are in FI, they are distinguishable. 

2. States in V. The argument is symmetric to that in FI. 

3. H and V. Consider {p',n — 1) G H and ((to — !)',(?) G FI. Note that (l',n — 1) accepts afb 
whereas ((to —1)', 1) rejects it. For every (p', q) there exists a permutation t induced by a word w 
in {a, b}* such that (!', l)t = (p', q). Then (p', q)t~^ = (!', 1). Hence (p', n — 1) and ((to — 1)', q) 
are mapped by t~^ to (l',n — 1) and ((to — 1)', 1), which are distinguishable. 
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Fig. 7. The DFAs 'D'g,{a,b, —,d,e) and 1 ) 7 ( 6 , a,d, e) for boolean operations. The empty states 8' and 7 
and the transitions to them are omitted. 


4. H and M. If {p',r) G M, then there exists a permutation t induced by a word w in {a,b}* 

such that {p',r)t = (l',2). The same t maps ((to — G H to some ((to — l)',s) with 

1 ^ s ^ n — 2. Then (!', 2)d = ((to — 1)', 2). If s 7 ^ 1, then sd G {2,..., n — 2), and we are done, 
since states ((to — l)^2) and ((to — l)^s) are in H and hence are distinguishable. Otherwise, 
((to — 1)', s)d = ((to — 1)', n — I) and again we are done. 

5. V and M. The argument is symmetric to that for H and M. 

6 . States in M. Apply Theorem 1 of [5]. 

For symmetric difference, we have as many states as for union. The arguments for distinguisha- 
bility are exactly the same as for union. 

For intersection, all states in iJ U F U {((to — l)',n — 1)} are empty, and all states in M are 
distinguishable by [5J Theorem 1]. Hence there are (to — 2)(n — 2) + 2 = mn — 2{m + n — 3) states 
altogether. 

For difference, all states in H U {((to — 1)', n — 1)} are empty, so this leaves (to — 2)(n — 1) + 2 = 
mn — (to + 2n — 4) states. The states in V are all distinct by the argument used for union. Also 
S M is distinguishable from ir',n — 1) S F, by the argument used for union. 

Since 5d = ^ci<5ci and 5e = SciSc 2 • • • where the Ci are from Proposition [5J the semigroup of 

'Dn{a, b,—,d, e) is in T'^®(n) n T^®(n). In fact, one can verify that the semigroup of Vnia, b,—,d, e) 
is 

4.2 Reversal 

Han and Salomaa [IS] showed that to meet the bound for reversal one can use the binary DFA of 
Leiss m and add a third input to get a suffix-free DFA. Cmorik and Jiraskova [53] showed that 
a binary alphabet will not suffice. We show a different ternary witness below, and prove that any 
witness must have its transition semigroup in 

Theorem 9 (Semigroup of Reversal Witness). For n ^ 4, the transition semigroup of a 
minimal DFA 2)„((5„, 17, <5, 0, F) of a suffix-free language that meets the bound 2"“^ -|- 1 for 
reversal is a subsemigroup o/T^®(n). 
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Proof. It suffices to show that no pair of states is colliding. 

We construct an NFA Af for by reversing the transitions of V, and interchanging the sets of 
final and initial states. We then determinize M using the subset construction to get a DFA TZ for 
L^. The states of TZ are sets of states of V. 

Consider a reachable subset S C Q^. We can assume that n — 1 ^ S, since n — 1 is not reachable 
from a start state in A/”. Also, if 0 S S' then S = {0}, as otherwise in V some transformation would 
map both 0 and (;S{l,...,n — l}toa final state so the language would not be suffix-free [2^ 
Lemma 6]. Hence, every subset S C {l,...,n — 2} must be reachable. 

Suppose that there are two distinct states p,q G {l,...,n — 2} such that the pair {p,g} is 
colliding. This means that OSw = P and q'Sw = q for some word w and q' G {1,... ,n — 2}. Since 
the subset {p,q} is reachable in T-pn, there is some such that = {p,q}. So p6f^ G F and 
qSf^ G F. But then wu G Ln, and also viuu G where v is such that = q', which contradicts 
that Ln is suffix-free. 

Theorem 10 (Reversal Complexity). Ifn'^A, then Ln{a,—,c,—,e) of Definition\^ meets the 
bound 2"“^ -|- 1 for reversal. 

Proof. As before, we construct NFA Af for - state n — 1 is now not reachable - and determinize 
Af to get the DFA TZ. We shall show that we can reach {0} and all subsets of Qn \ {0, n — 1}. 

The initial state of N consists of all the odd states in Qn \ {0, n — 1}. By applying e we reach 
the set {0}. 

Consider a subset S C \ {0, n — 1}. By Proposition|6](a) we have all transformations mapping 
Q\ {0, n — 1} into (5\ {0}; in particular, if n is odd, there is a transformation 5w mapping S into the 
initial subset {1,3,..., n — 4, n — 2} and Qn \ S' to {n — 1}. Thus S~^ maps {1, 3,..., n — 4, n — 2} 
onto S. If n is even, replace {1,3,..., n — 4, n — 2} by {1, 3,..., n — 5, n — 3}. 

Now we prove that these 2"“^ -|- 1 states (sets of states of V) are pairwise distinguishable. Set 
{0} is the only final state. Consider two sets i?, S C (5„ \ {0, n — 1} and suppose that q G R® S. 
Then a'^“^e is accepted from the set that contains q but not from other set. Therefore there are no 
equivalent states. 

Although Ln{a, —, c, —, e) meets the bound for the number of atoms, it does not meet the bounds 
on the quotient complexity of atoms. 


4.3 Complexity of Atoms in Sufflx-Pree Languages 

Let Qn = {0,...,n — 1} and let L be a non-empty regular language with quotient set K = 
{Kq, ... ,Kn-i}. Let V = {Qn, F,S,0,F) be the minimal DFA of L in which the language of 
state q is Kq. 

Denote the complement of a language L hy L = S* \L. Each subset S of Qn defines an atomic 
intersection As = Hies Hies Ki, where S = Qn \ S. An atom of L is a non-empty atomic 
intersection. Since atoms are pairwise disjoint, every atom A has a unique atomic intersection 
associated with it, and this atomic intersection has a unique subset S oi K associated with it. This 
set S is called the basis of A. 

Let As = n*es Ki C flies atom. For any w G S* we have 

w~^As = Pi w~^K., n p w-^Ki. 

ies igs 
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Since a quotient of a quotient of L is also a quotient of L, w ^^5 has the form: 

w-^As = f| if. n f| 

i&X i&Y 

where |X| ^ IS”] and |y| ^ n — IS”!, X,Y A Q^. 

Proposition 7. Suppose L is a suffix-free language with n ^ 4 quotients. Then L has at most 

2"“2 _|_ I atoms. Moreover, the complexity k{As) of atom As satisfies 

f<2"-2 + l, tfS = $; 

«^(-4s) >1 = ?T-, ifS = {0}; (2) 

i ^ 1 + (E') (”T") > M ^ C { 1 ,..., n - 2 }. 

Proof. If n — 1 S S, then As is not an atom. Also, Kq Ci Ki = 0 for i = 1,..., n — 1, since L is 

suffix-free. Hence 0 G S implies S = {0}. It follows that there are at most 2"“2 _(- 1 atoms. 

For atom complexity, consider the following cases: 

1. S' = 0. Here S = Qn, Afj^ = Cli^Qn ^ ^ ' w~^As = where 0 ^ F, 

since w~^L = L only if w = e, and n — 1 is always in Y since w~^Kn-i = ifn-i because 
Kn-i = 0. Thus we have the initial quotient Ag with 0 G S and at most 2"“2 choices for the 
other quotients of As. Hence the complexity of Ag is at most 2"“2 _(- 1 . 

2. S = {0}. Since the language is suffix-free, we have that ifo H ifg = 0 for all q G Q \ {0}. Thus 
Aq = ifo n (ifi) n ... n (Kn-i) = ifo, and Kq = Ln has n quotients. 

3. Since n — 1 always appears in S and the S = {0} case is done, there remain the cases where 
0 S C (5„ \ {0, n — 1}. Suppose (X,Y U {n — 1}) represents a quotient of As by a non¬ 
empty word w. If Odu, = n — I, then X must have at least one and at most IS"! elements from 
Qn \ {0, n — 1}. Since 0 appears only initially, Y cannot contain 0. So in Y there must be from 
0 to n — 2 — \X\ states from Qn \ ({0, n — 1} U X). Hence we have the formula from Equation!^ 
Suppose that OJu, ^ n — 1. The 0-path in Sw is aperiodic and ends in 0, so there exists a state 
P G Qm such that pS^j = n — 1. If p G S, then n — 1 G X, and so {X, Y) represents the empty 
quotient. If p G Q \ S', then again F \ {n — 2, n — 1} contains at most n — 2 — |S| states. 

Following Ivan [35] we define a DFA for each atom: 

Definition 3. Suppose V = {Q, E,d, qo, F) is a DFA and let S C Q. Define the DFA Vs = 

{Qs,F,A,{S,S),Fs), where 

- Qs = {(A,F) |X,FCQ,XnF = 0}U{T}. 

- For all aG E, {X,Y)a = {Xa,Ya) if XaCiYa = 0, and (X,Y)a = T otherwise; and Ya = T. 

- Fs = {(X,F) |XCF,FCF}. 

DFA Vs recognizes the atomic intersection A 5 of L. If Vs recognizes a non-empty language, then 

As is an atom. 

Theorem 11. For n 'Y 4, the language Ln{Vn{a,b,c,d,e)) of Definition meets the bounds of 

Proposition^ for the atoms. 

Proof. There are three cases to consider: 
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1. ^ = 0. The initial state of Vs is (0, Qn)- Since (0, Qn)ed = (0, {n — 1 }), which is a final state, 
Ag is an atom. For any non-empty word w, we have a quotient of A 0 represented by (0,T), 
where 0 ^ V, n — 1 € Y, and F \ {n — 1} is a subset of Qn \ {0, n — 1}. By Proposition | 6 ] the 
transition semigroup of V contains all transformations that send 0 and n — 1 to n — 1 and map 
Qn \ { 0 , n — 1 } into Qn \ { 0 }; hence all 2 "“^ quotients of by non-empty words are reachable. 
Together with A^ we have 2"“^ -|- 1 quotients of A^. 

Since (0, < 3 , 1)6 = (0, {l,n — 1}), which is non-final, and (0,F)e = (0, {n — 1}), which is final, 
(0,y) is distinguished from the initial state. We claim also that all pairs (0, F) and (0, F'), 
F ^ F', are distinguishable. Without loss of generality, suppose that q S F \ F'. If we map q 
to 1 and (F U F') \ {1} to n — 1 by a word w, then (0, Y)w = (0, {1, n — 1}), which is non-final, 
and (0, F')^ = (0, {n — 1}), which is final. Hence the bound 2"-“^ -|- 1 is met. 

2. S = {0}. The initial state of Vs is ({0},(3ra \ {0}). We have ({0},(3ra \ {0})a: = T for all 
X G S \ {e}, and ({0}, Qn \ {0})e = ({!}, {n — 1}). From ({!}, {n — 1}) we can reach any state 
({<?}, {"n — 1}) with g G (3ra \ {0, n — 1}. Thus we can reach n states altogether: the initial state, 
n — 2 states ({<?}, {n — 1}) with q G Qn \ {0,n — 1}, and T. 

Since ({0}, <3n\{0})e = ({!}, {n—1}), which is final, the initial state is non-empty and so differs 
from T. For q G Qn \ {0, n— 1}, consider the word w that maps g to 1 and Qn \ {0, q} to {n — 1}. 
Then ({g}, {n — l})ry = ({!}, {n — 1}), which is final, while ({0}, Qn \ {0})ry = {{n — 1}, F), 
which is non-final since F contains n — 1. Hence ({<?}, {n — 1}) is distinguishable from the initial 
state and T. Finally, for p, g G (3ra\{0, n —1}, {{p}, {n —1}) is distinguishable from {{q}, {n —1}) 
by the mapping that sends p to 1 and q to n — 1. Hence the bound n is met. 

3. 0 ^ 5” C Qn \ {0, n — 1}. The initial state here is (S', S), where 0 ^ S C {1,..., n — 2}. Since we 
can map S to {1} and S to {n — 1}, each such intersection is an atom. If M = {1,..., n — 2}, 
the atom has the form (S, S) = (S, (M \ S) U {0,n — 1}). By applying e, we get T. To get a 
state of Hs different from T, we can map S to any non-empty subset X of M of cardinality 
|X|, where 1 < |X| < |S|; we can map M \ S to any subset F of (M U{n — 1}) \X of cardinality 
|F|, where 0 ^ |F| ^ \M\ — |X|; and we can map 0 and n — 1 to n — 1. Each such intersection 
{X, F U {n — I}) represents a non-empty quotient of by a non-empty word because we can 
map X to {1} and F to {n — I}. Hence we have the formula in Equation [5] for the number of 
reachable states of Vs- 

Now consider two states {X, F U {n — 1}) and {X', Y'U{n — I}), where one of these two states 
could be the initial one. If X ^ X', assume that q G X'\X. Let w map X to {1} and Qn \X to 
n — 1. Then {X, Y U{n — I})w = ({I}, {n — I}), while the first component of {X', F'U{n — I})w 
contains n — 1, meaning that that state is T; so these states are distinguishable. A symmetric 
argument applies if g £ A \ A'. 

This leaves the case where A = A'. Then A is disjoint from both F and F'. If g G F' \ F, let 
w map A to {!}, F U {0} to {n — 1}, and F' \ F to {!}. 

Then (A, F U {n — l})u> = ({!}, {n — 1}), while (A, F' U {n — l})u> = ({I}, {1, n — I}) = T, 
and we have distinguishablity. A symmetric argument applies if g G F \ F'. 

Theorem 12. The transition semigroup Tn of a minimal DFA A, i5,0, E) with n ^ 4 of 

a suffix-free language Ln that meets the bounds for atom complexities from Proposition is a 
subsemigroup o/T^®(n) and is not a subsemigroup o/T^®(n). 

Proof. It is sufficient to show that every pair of states p, g G {I,...,n — 2} is focused by some 
transformation from T„. 

Let p, g G {1,..., n — 2} be two distinct states. Consider the atom A^p ^y. From the proof of 
Proposition^ to meet the bound for the quotient complexity of Ayp ^y every pair (A, F U {n — I}) 
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must represent a quotient of the atom by a non-empty word w, where X^Y C {1,..., n — 2} are 
disjoint, 1 ^ |X| ^ 2, and |F| ^ n — 2 — |X|. In particular, when \X\ = 1 we know that the 
transformation of w maps both p and q to the state from X, and so p and q are focused. 

Remark 1. The complexity of atoms in left ideals [9] is 


I = n, a S = Qn\ 

At(kls) i 2"-i, if 5 = 0; 

1^1 + E”=i"' {V) {^--7) > otherwise. 

The formula for S ^ {0, Qn} evaluated for n — 1 and 5 C {1,..., n — 2} becomes 



which is precisely the formula for suffix-free languages. 

Tables [T] and [5] show the quotient complexities of atoms for small n. 


(3) 


Table 1. Suffix-free atom complexity. The entries from left to right are suffix-free language, left ideal, 
regular language. The * stands for “not applicable”. 


n 

1 

2 

3 

4 

5 


|5| = 0 

*/l/l 

*/2/3 

*/4,/7 

5/8/15 

9/16/31 


|5| = 1 

Vi/1 

*121^ 

>k/5/10 

5/13/29 

13/33/76 


|5| = 2 


*/2/3 

*/4/10 

4/16/43 

16/53/141 


|5|=3 



>k/3/7 

*/8/29 

8/43/141 


|5|=4 




*/4/15 

>k/16/76 


|5|=5 





*/5/31 


max 

Vi/1 

>i</2/3 

>i</5/10 

5/16/43 

16/53/141 


ratio 

- 

*/2.00/3.00 

>k/2.50/3.33 

>k/3.20/4.30 

3.20/3.31/3.28 



5 Conclusions 

It may appear that the semigroup T^®(n) should not be of great importance, since it exceeds 
T^^{n) only for n = 4 and n = 5, and therefore should not matter when n is large. However, our 
results show that this is not the case. We conclude with our result about the non-existence of single 
universal suffix-free witness. 
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Table 2. SufRx-free atom complexity continued. 


n 

6 

7 

8 

9 

I'S'l = 0 

17/32/63 

33/64/127 

65/128/255 

129/256/511 

I'S'I = 1 

33/81/187 

81/193/442 

193/449/1, 017 

449/1,025/2,296 

I'S'l = 2 

53/156/406 

156/427/1,086 

427/1,114/2, 773 

1,114/2,809/6, 859 

IS"! = 3 

43/166/501 

166/542/1,548 

542/1,611/4, 425 

1,611/4,517/12,043 

IS"! = 4 

16/106/406 

106/462/1, 548 

462/1,646/5,083 

1,646/5,245/15,361 

IS"! = 5 

'x/32/187 

32/249/1, 086 

249/1, 205/4, 425 

1,205/4, 643/15,361 

IS"! = 6 

>k/6/63 

*/64/442 

64/568/2, 773 

568/3,019/12,043 

IS"! = 7 


*/7/127 

>k/128/1,017 

128/1,271/6,859 

IS"! = 8 



>i</8/255 

*/256/2,296 

IS"! = 9 




'x/9/511 

max 

53/166/501 

166/542/1,548 

542/1, 646/5, 083 

1,646/5,245/15,361 

ratio 

3.31/3.13/3.55 

3.13/3.27/3.09 

3.27/3.04/3.28 

3.04/3.19/3.02 


Theorem 13. For n ^ 4, there does not exist a most complex suffix-free language. 

Proof. If there exists a most complex DFA 'D{Qn, S,S,0, F), then in particular it would have to 
meet the bound for reversal and star. From Theorem M the transition semigroup of V must be a 
subsemigroup of T^®(n), but from Theorem!^ we know that it cannot be a subsemigroup of 
Consequently no most complex suffix-free language exists for n ^ 4. 

The first four studies of most complex languages were done for the classes of regular languages [6] , 
right ideals mm. left ideals |yll()llb| . and two-sided ideals mm- In those cases there exists 
a single witness over a minimal alphabet which, together with its permutational dialects, covers 
all the complexity measures. In the case of suffix-free languages such a witness does not exist. Our 
study is an example of a general problem: Given a class of regular languages, find the smallest set of 
witnesses over minimal alphabets that together cover all the measures. The witness of Definition [T] 
is conjectured to be over a minimal alphabet, unless the bound for product can be met by binary 
DFAs for every n,m > c, for some c; this is an open problem. The witness of Definition [5] is over a 
minimal alphabet, since five letters are required to meet the bound for syntactic complexity. 
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